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ALGEBRAIC METHODS TOWARD HIGHER-ORDER 
PROBABILITY INEQUALITIES, II 

By Donald St. P. Richards 

Pennsylvania State University 

Let (L, =4) be a finite distributive lattice, and suppose that the 
functions /lJjiL-tl are monotone increasing with respect to the 
partial order =5;. Given u a probability measure on L, denote by E(/;) 
the average of f% over L with respect to fi, i = 1,2. Then the FKG 
inequality provides a condition on the measure fi under which the co- 
variance, Cov(/i,/2) :=E(/i/a) — E(/i)E(/2), is nonnegative. In this 
paper we derive a "third-order" generalization of the FKG inequality: 
Let /1, /2 and /3 be nonnegative, monotone increasing functions on 
L; and let /i be a probability measure satisfying the same hypotheses 
as in the classical FKG inequality; then 

2E(/i/a/s) 

- pE(/i/ a )E(/a)+E(/i/s)E(/ a ) +E(/ 1 )E(/ a / s )] 

+ E(/ 1 )E(/ 2 )E(/ 3 ) 

is nonnegative. This result reduces to the FKG inequality for the case 
in which /35I. 

We also establish fourth- and fifth-order generalizations of the 
FKG inequality and formulate a conjecture for a general mth-order 
generalization. For functions and measures on R" we establish these 
inequalities by extending the method of diffusion processes. We pro- 
vide several applications of the third-order inequality, generalizing 
earlier applications of the FKG inequality. Finally, we remark on 
some connections between the theory of total positivity and the exis- 
tence of inequalities of FKG-type within the context of Riemannian 
manifolds. 

1. Introduction. In the realm of probability inequalities, the FKG in- 
equality, due to Fortuin, Kasteleyn and Ginibre (1971), now occupies a po- 
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sition of fundamental importance because of its simplicity and widespread 
applicability. Before we state in detail this remarkable inequality, it is worth- 
while to note some of its many applications. 

In statistics, the FKG inequality has appeared in the study of monotonic- 
ity properties of power functions of likelihood ratio test statistics in multi- 
variate analysis, association and dependence properties of random variables, 
and observational studies; see Sarkar (1969), Perlman and Olkin (1980), 
Eaton (1987), Karlin and Rinott (1980) and Rosenbaum (1995). 

In probability theory and mathematical physics, the FKG inequality has 
appeared in the areas of diffusion equations, interacting particle systems, 
Ising models, reliability theory and percolation; see Grimmett (1999), Herbst 
and Pitt (1991), Lebowitz (1972), Liggett (1985), Preston (1976), Simon 
(1974, 1979) and Glimm and Jaffe (1987). 

In work on total positivity and its connections with the theory of finite 
reflection groups and analysis on Lie groups, an analog of the FKG inequality 
was derived by Gross and Richards (1995). 

In combinatorial theory, the FKG inequality has appeared in work on 
the monotonicity of partial orders, Sperner theory, graph theory and Ram- 
sey theory; see Graham (1982, 1983), Engel (1997), Cameron (1987) and 
Seymour and Welsh (1975). 

We now state the inequality. Let L be a finite distributive lattice with 
partial ordering =<!, least upper bound V and greatest lower bound A. A 
function / : L — > M. is called (monotone) increasing if f(x) < f(y) whenever 
x =<! y. A probability measure [i on L is said to be multivariate totally positive 
of order 2 (MTP 2 ) if 

(1.1) fi(xyy)n(xAy)>fi(x)fj,(y) 

for all x,y € L. In some parts of the literature, an MTP2 probability measure 
fi is called an FKG measure or log-supermodular, we prefer the MTP2 termi- 
nology, reflecting the relationship with the classical theory of total positivity 
wherein (1.1) is an abstract formulation of the concept of total positivity of 
order 2. 

For any probability measure \i on L and any function / : L — ► R, denote 

by 

E(/) :=5>(a)/(a) 

the mean or average of / with respect to [i. 

Suppose fi and fi are both increasing (or both decreasing) real-valued 
functions on L, and let [i be an MTP2 probability measure on L. Then the 
FKG inequality provides that 



(1.2) 



Cov(/i,/ 2 ) := E(/i/ 2 ) - E(/0E(/ 2 ) > 0. 
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In mathematical statistics, it is usual to state the FKG inequality for the 
space R n . In that setting, for vectors x = (x\, . . . , x n ) and y = ( yi, . . . , y n ) 
in R n , the corresponding partial order is given by x ^ y if Xj < yj for all 
j = 1, . . . , n; the lattice operations V and A are 

x V y = ( max(xi , yi) , . . . , max(x n , y n ) ) 

and 

x Ay = (min(xi,yi), . . . , min(x n , y n )); 

a function f :M. n — > R is increasing if /(x) < /(y) whenever x =4 y; and a 
probability density function If : R n — > R, or its underlying random vector 
(Xx, . . . ,X n ), is said to be multivariate totally positive of order 2 (MTP2) if 

(1.3) K(xVy)K(xAy)>K(x)K(y) 

for all x, y € R n . If the functions fi,f 2 : R n — ► R are both increasing or both 
decreasing and the expectations E(fi), E{$2) and E(fif2), with respect to 
K, are finite, then the FKG inequality on R n provides that 



(1.4) 



fi{x)f 2 (x)K(x)dx 



h(x)K(x)dx^ ■ (J^J 2 {x)K{x)dx^ >0. 



It is well known that the continuous case of the FKG inequality, (1.4), can 
be deduced by an approximation argument from the discrete case, (1.2); see 
Karlin and Rinott (1980). We also remark that numerous generalizations 
of (1.2) have appeared in the literature; see Ahlswede and Daykin (1978), 
Batty and Bollmann (1980), Holley (1974), Preston (1974), Kemperman 
(1977), Edwards (1978) and Rinott and Saks (1993). These results typically 
provide conditions leading to inequalities of the form 

Y[fi{x)gi{x) -K(x)dx 
i=i 

~ ( JnMx)-K(x)dx\ ( Jj[g i (x)-K(x)dx ] j >0 

for classes of nonnegative functions /1 , . . . , fk and g\ , . . . , g^- 

In this paper we derive generalizations of (1.2) or (1.4) involving alter- 
nating sums with more than two terms. Recall that the covariance between 
two random variables is an example of a cumulant (or Ursell function) of 
those random variables, so the FKG inequality (1.2) provides an inequality 
for the simplest cumulant of the random variables f\ and f 2 - Hence, in a 
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search for "higher-order" generalizations of (1.2) involving several functions 
fi, f'2, ■ ■ ■ , it is natural that we study the higher cumulants. 

There are well-known probability distributions for which the FKG in- 
equality holds but for which the higher cumulants are nonpositive. Indeed, 
in the case of any Gaussian distribution, the higher cumulants are identically 
zero. It is then apparent that, without additional restrictions on the density 
function K, the cumulants themselves cannot provide direct generalizations 
of (1.2) [in this regard, we refer to Percus (1975) and Sylvester (1975) for 
other types of correlation inequalities for Ursell functions]. 

Nevertheless, the algebraic structure of the higher-order cumulants pro- 
vides crucial motivation for generalizing the FKG inequality. To explain, 
recall that the alternating sum 

K3(/l,/2,/ 3 ) :=nhhh) 
(1.5) - pE(/i/ 3 )E(/ 3 ) +E(/i/ 3 )E(/ 2 ) +E(/ 1 )E(/ 3 / 3 )] 

+ 2E(/0E(/ 2 )E(/ 3 ) 

is a cumulant of the random variables f±, / 2 and / 3 [cf. Speed (1983)]. We 
define the third- order conjugate cumulant 

4(/i,/2,/3) :=2E(/i/ 2 / 3 ) 
(1-6) - [E(/i/ 2 )E(/ 3 ) + E(/i/ 3 )E(/ 2 ) + E(/ 1 )E(/ 2 / 3 )] 

+ E(/i)E(/ 2 )E(/ 3 ), 

which is derived from k 3 by "reversing" the order of the absolute value of the 
coefficients appearing in (1.5). The general mth-order conjugate cumulant 
will be defined in a similar manner. Our first main result, proven in Section 2, 
is the following. 

Theorem 1.1. Let L be a finite distributive lattice, let fj, be an MTP 2 
probability measure on L, and let fi, / 2 and / 3 be nonnegative increasing 
functions on L. Then 

(1-7) 4(/i>/2,/ 3 )>0. 

As a consequence of Theorem 1.1, we obtain a lower bound on the cumu- 
lant K 3 (/i,/ 2 ,/ 3 ). 

Corollary 1.2. Under the same hypotheses as Theorem 1.1, 



« 3 (/i,/2,/3) > -[E(/i/ 2 / 3 )-E(/ 1 )E(/ 2 )E(/ 3 )]. 
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That the inequality (1.7) generalizes the FKG inequality can be seen in 
three ways. First, (1.7) reduces to (1.2) if / 3 = 1. Second, (1.7) does not 
generally reduce to (1.2) if, say, / 3 is the indicator function of a proper 
subset of L. Third, it is straightforward to verify that (1.7) can be rewritten 
as 

CovCA/a, / 3 ) - E(/0 Cov(/ 2) / 3 ) + Cov(/i/ 3) / 2 ) > 0, 

expressing the property that an alternating linear combination of nonneg- 
ative covariances is nonnegative. It is also a pleasant surprise that these 
results hold under the same hypotheses on K required for the classical FKG 
inequality, so that the only additional assumptions required are the existence 
of the various expectations. 

With Theorem 1.1 in place, it is natural to ask for similar generalizations 
of the FKG inequality involving four or more functions. To that end, we 
shall prove the following results. 



Theorem 1.3. Let fj> be an MTP 2 probability measure on L and let f{, 
i = 1, . . . , 5, be nonnegative increasing functions on L. Then the fourth- and 
fifth- order conjugate cumulants, 

4 := 6E(/!/ 2 /3/ 4 ) 
(1.8) - 2[E(/ 1 / 2 / 3 )E(/ 4 ) + •••]- [E(A/ 2 )E(/ 3 / 4 ) + • • •] 

+ [E(/,/ 2 )E(/ 3 )E(/ 4 ) + •••]- E(/ 1 )E(/ 2 )E(/ 3 )E(/ 4 ) 

and 

k' 5 :=24E(/ 1 / 2 / 3 / 4 / 5 ) 

- 6[E(/ 1 / 2 / 3 / 4 )E(/ 5 ) + •••]- 2[E(/ 1 / 2 / 3 )E(/ 4 / 5 ) + • • •] 

(1.9) 

+ 2[E(/ 1 / 2 / 3 )E(/ 4 )E(/ 5 ) + •••] + [E(/ 1 / 2 )E(/ 3 / 4 )E(/ 5 ) + • • •] 

- [E(/ 1 / 2 )E(/ 3 )E(/ 4 )E(/ 5 ) + •••]+ E(/ 1 )E(/ 2 )E(/ 3 )E(/ 4 ), 
are nonnegative. 

In the above results, the notation, for example, "E(/i/ 2 )E(/ 3 )E(/ 4 ) + 
• • ■" is shorthand notation for the sum over all distinct terms consisting of 
products of expectations of the set of functions {/i,...,/ 4 }, divided into 
subsets of sizes 2, 1, and 1. Explicitly, 

E(/i/ 2 )E(/ 3 )E(/ 4 ) + ■■• 

= E(/!/ 2 )E(/ 3 )E(/ 4 ) + E(/!/ 3 )E(/ 2 )E(/ 4 ) + E(/,/ 4 )E(/ 2 )E(/ 3 ) 

+ E(/i)E(/ 2 / 3 )E(/ 4 ) + E(/ 1 )E(/ 2 / 4 )E(/ 3 ) +E(/x)E(/ 2 )E(/ 3 / 4 ). 
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In Section 2 we shall prove Theorem 1.1. Once this is complete, the proof 
of Theorem 1.3 is seen to be a consequence of elementary, but lengthy, 
algebraic calculations which are easily performed using computer algebra 
software, for example, MAPLE. Further, we shall conjecture a generalization 
of Theorems 1.1 and 1.3 to the case of an arbitrary number of functions. 

In Section 3 we apply the method of diffusion processes to provide another 
approach to the generalized FKG inequalities. We extend a method due to 
Herbst and Pitt (1991) who derived the FKG inequality in their work on 
diffusion semigroups which are stochastically monotone and which preserve 
the class of positively correlated measures on M. n . While the methods utilized 
in Section 2 are patently algebraic in flavor, the methods used in Section 3 
may initially appear to be analytic in nature; however, a deeper reading will 
also reveal the ideas of this latter section to be as algebraic in characteristic 
as are those of Section 2. 

In Section 4 we approach the problem of proving Theorem 1.1 through 
the method of duplicate variables; this approach seems significantly more 
complex than in the case of the classical FKG inequality, and we provide 
explicit details for the case in which n = 1. In Section 5 we provide a number 
of applications, extending some well-known applications of the classical FKG 
inequality. Finally, in Section 6 we remark on connections between the theory 
of total positivity and the possible existence of inequalities of FKG-type 
within the context of weakly symmetric Riemannian manifolds. 

2. Finite distributive lattices. Let L be a finite distributive lattice with 
partial order =<!, least upper bound V and greatest lower bound A. Recall that 
every finite distributive lattice is order-isomorphic to the lattice of subsets of 
a finite set. Then there exists a finite set A such that L = 2 A , the collection 
of all subsets of A. Endowed with set inclusion as the partial order, and 
with union and intersection as least upper bound and greatest lower bound, 
respectively, 2 A is a finite distributive lattice. 

A probability measure fi on 2 A is said to be multivariate totally positive 
of order 2 (MTP 2 ) if, for all a,bCA, 

(2.1) fj,(a\Jb)fji(anb)>fi(a)fj,(b). 

A function f:2 A — >M is called (monotone) increasing if f(a) > f(b) when- 
ever a, b C A satisfy a D b. We denote the expected value of / with respect 
to the measure \x by 

E(/) := £ M (a)/(a). 

aCA 

We now establish (1.7). 

Proof of Theorem 1.1. Our argument follows that of den Hollander 
and Keane (1986). Without loss of generality, we assume that L = 2 A for 
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some finite set A. If the cardinality of 2 A is 1, that is, A = 0, then (1.7) 
holds trivially; therefore we may suppose that A is nonempty. 

Let us first assume that /u(a) > for all a C A. Choose and fix B, an 
arbitrarily chosen subset of A. For any aCB, define 

(2.2) /Ma):= £ /i(aU6) 

and, for any function / : 2 A — > R, define 

(2.3) / B ( a ) : =^_ £ M (oU6)/(oU6). 

As observed by den Hollander and Keane (1986), it is clear that hb is the 
marginal probability measure on the lattice 2 B ; also, /s(a) is the conditional 
expectation of /, given a QB. It can be shown [cf. den Hollander and Keane 
(1986), page 171] that /i# is MTP2, and that is increasing if / is increasing 
[see Eaton (1987) or Karlin and Rinott (1980) for similar results in the case 
of R n ]. 

For any function g : 2 B — > R, we define 

aCB 

In the sequel we shall need the double expectation theorem: For any B C. A, 

(2.4) E(/) = Yl Ka)f(a) = £ mb(o)/b(o) = E b (/b). 

aCyl aCB 

In words, the expected value of / equals the expected value of its conditional 
expectations. 

To establish (1.7) it suffices, as observed by den Hollander and Keane 
(1986) in the case of the FKG inequality, to assume that B = A \ {z}, where 
z € A is chosen arbitrarily; this amounts to a proof by induction on the 
length of maximal chains in the lattice 2 A . Using the shorthand notation 
fiB to denote (fi)B, i = 1,2,3, we claim that 

2E b ((/ 1 / 2 / 3 )b) 

(2.5) - [Eb((/i/ 2 )b/ 3 b) +E b ((/i/ 3 )b/ 2 b) +Eb(/ib(/ 2 / 3 )b)] 
+ Eb(/ib/ 2 b/ 3 b)>0. 

By (2.2) and (2.3) we have 

(2.6) fi B {a) = n{a) + n{aU{z}) 
and 

(2.7) f B (a) = -L-(jjL(a)f(a) + fi(a U {z})/(a U {z})) 

fj,B{a) 
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for aC B. By (2.7) we have 

HB(af{hf 2 h)B{a) 

(2.8) =fi B (a) 2 [fi(a)f l (a)f 2 (a)Ma) 

+ fi(a U (a U {z})f 2 (a U {*})/ 3 (a U {z})]. 

Further, for {i, j, k] = {1, 2, 3}, 
^B(a) 3 (/i/j)s(a)/fcB(a) 

(2.9) = MB(a)Ma)/i(a)/j(a) + m(« U {z})fi(a U {*})/,• (a U {z})] 

x [M(a)/ fc (a) + ^x(a U {z})f k {a U {z})] 

and 

^B{a) 3 f 1B {a)f 2B (a)f 3B (a) 

(2.10) 

= E[ + M« U {z})/,(a U {*})]. 

i=l 

Collecting together (2.8)-(2.10) and performing elementary algebraic sim- 
plifications with the aid of (2.6) and (2.7), we obtain 

H B {af{2{hf 2 h) B {a) 

- [Uih)B(a)h B {a) + tfih) B {a)f 2B {a) + / 1B (a)(/ 2 / 3 ) B (a)] 

(2.11) +/ib(o)/ 2 b(o)/ 3 b(o)} 
= /'(«)/'(ttU{z}) 

x [(Ma U {z}) - /!(a))$i B (o) + /i(a U {z})* 2 fl(a)], 

where 

$ 1B (a) = /x(a)(/ 2 (a U {z})/ 3 (a U {z}) - / 2 (a)/ 3 (a)) 

(2.12) + M (a U {z})/ 3 (a)(/ 2 (a U {z}) - / 2 (a)) 
+ n(a U {z})/ 2 (a)(/ 3 (a U {z}) - / 3 (o)) 

and 

$ 2B (a) = (fi(aU {z}) + fi(a)) 

(2.13) 

x (f 2 (a U {z}) - / 2 (o))(/ 3 (a U {z}) - / 3 (o)). 

Since each /j is nonnegative and increasing, it follows that (2.12) and (2.13) 
are sums of products of nonnegative terms; hence (2.11) is nonnegative. 
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Next we divide both sides of (2.11) by fi B (a)' 2 and sum over all aCB. 
We have 

E Mfl(a)(/i/2/s)B(a) =E b ((/i/ 2 / 3 )b) =E(/i/ 2 / 3 ), 

aCB 

the latter equality following from the double expectation theorem (2.4). For 
{i,j, k} = {1, 2, 3}, we have 

E VB{a)(fif j )B{a)fkB(a) = E B ((f i f j ) B f kB ); 

a<ZB 

and also 

E VB{a)flB{a)f2B(a)f3B(a) = E B (f 1B f 2B h B ). 

aCB 

Collecting these identities together and applying the nonnegativity of (2.11), 
we obtain (2.5). 

Since B was chosen arbitrarily, we may set B = in (2.5). By (2.2) 
and (2.3) we have E (f i0 ) = E(/i), E ((f i f j ) f k0 ) = E(f i f j )K{f k ), where 
= {1,2,3}, and also E (/i / 20 / 30 ) = E(/ 1 )E(/ 2 )E(/ 3 ). Then (2.5) 
reduces to (1.7). 

Finally, the case in which fj, is not everywhere positive is resolved as in the 
case of the FKG inequality [see den Hollander and Keane (1986), page 172]. 
That is, we can carry through the above arguments once \i B and the fi B are 
defined on the set {a C B : fJ. B (a) ^ 0}; this is done in any way which ensures 
that all fiB are increasing on 2 B , and the actual choice is immaterial since 
all we need is that each fc B is increasing on the support of [i B . □ 

Remark 2.1. (i) The assumption that the functions /i, / 2 and / 3 are 
nonnegative is essential; in particular, we cannot avoid this assumption by 
adding a constant to each function, for k 3 (/i,/2,/3) is not invariant under 
such shifts. A referee has noted that without the positivity assumption, 
E(X 3 ) can be made arbitrarily negatively large relative to other moments, 
and this would violate the inequality for fi(x) = x, i = 1,2, 3. 

(ii) Consider the range of values of positive coefficients c\ , c 2 and c 3 such 
that 

ciE(/i/ 2 / 3 ) - c 2 [E(/ 1 / 2 )E(/ 3 ) +E(/i/ 3 )E(/ 2 ) +E(/i)E(/ 2 / 3 )] 
+ c 3 E(/ 1 )E(/ 2 )E(/ 3 ) > 

for all nonnegative increasing functions f\, / 2 and / 3 . For simplicity, let us 
work with functions on M. 2 . In order to make comparison with Theorem 1.1, 
we obviously need to impose the restriction 



ci - 3c 2 + c 3 = 0. 



10 



D. ST. P. RICHARDS 



Without loss of generality, we may assume c 2 = 1. By setting / 3 = 1 we 
obtain 

(ci - l)E(/i/ 2 ) - (2 - c 3 )E(/i)E(/ 2 ) > 0. 

In order to maintain positive coefficients, we must impose the additional 
restriction c\ > 1. 

For j = 1,2,3, let Oj,6j € R and let /, be the indicator function of the 
set {(u,v) £ R 2 :u > a,j,v > bj}. Denote the underlying random vector by 
(X,Y); by choosing suitable distributions for (X,Y) it can be shown that 
the condition c\ > 1 is not sufficient to ensure that ^(/i, /2> Z^) > 0. For 
instance, suppose a\ < a 2 < «3 and &i < & 2 < 63. Denoting P(X > Oj , y > 67) 
by 7Tj, we obtain 

«3(/l> /2) /3) = Cl7T 3 - (27r 2 7r 3 + 7Ti7r 3 ) + c 3 7ri7r 2 7r 3 

EE 7T 3 [(1 " 7T 2 )(2 + (CI - 3)7Tl) + (CI " 2)(1 - TTl)]. 

For ci > 1 we then have 

4(/i 5 /2,/ 3 )>7r 3 (l-7r 1 )(l-27r 2 ). 

For suitable (X,Y) it is possible to attain this lower bound; moreover, the 
bound can be negative. Therefore the restriction c\ > 1 is not sufficient to 
ensure nonnegativity of k 3 (/i,/ 2 ,/ 3 ) for all nonnegative increasing fj. By 
choosing the fj from among the class of indicator functions of the type 
above, we can deduce that c\ > 2 is sufficient. We will return to this theme 
in Section 6. 

Returning to the general context, a close inspection of the proof of The- 
orem 1.1 shows that we have also obtained a collection of lower bounds for 
ECfi/2/3). 

Corollary 2.2. Let n be an MTP 2 measure on 2 A and let /1, / 2 and 
/ 3 be nonnegative increasing functions on 2 . For any B Q A, there holds 
the lower bound 

2E(/i/ 2 / 3 ) >E B ((/ 1 / 2 ) B / 3B ) + E B ((/i/ 3 ) B / 2 s) 

(2.14) 

+ Eb(/ib(/2/s)b) -E B (/ 1B / 2B / 3B ). 

Remark 2.3. In the case of the FKG inequality, that is, for the case in 
which / 3 = 1, the left-hand side of (2.5) reduces to 

(2.15) E b ((/i/ 2 )b) - E B (/i B / 2B ) = E(A/ 2 ) - E B (f 1B f 2B ). 

In addition to establishing the FKG inequality, den Hollander and Keane 
[(1986), Theorem 4(b)] establish the sharper inequality that (2.15) is a de- 
creasing function of B £2 A . This raises the issue of whether the left-hand 
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side of (2.5) satisfies similar monotonicity properties. Even for the simplest 
lattices, such monotonicity properties appear difficult to discern. In particu- 
lar, as the following counterexample shows, the left-hand side of (2.5) is not 
generally monotonically decreasing in B. 

Suppose that A = {w,z}, a set with two distinct elements. The corre- 
sponding lattice is 2 A = {0, {w}, {z}, {w, z}}. Define a probability measure 
H on 2 A by: /i(<£) = 1/2, fi({w}) = f^({z}) = 1/8 and fi{{w,z}) = 1/4; then 
it is straightforward to verify that fi is MTP2. Define three nonnegative 
increasing functions fx, fi and f 3 on 2 A by the substitutions 

if a = 0, 
if a = {w}, 
if a = {z}, 
if a = {w, z}, 

where ai, Pi,ji,6i >0,i= 1,2,3. If we denote by g{B) the left-hand side of 

(2.5) , then for the case in which (01,02,03) = (fix, P2, Pz) = (1,2,3), (71,72,73) = (4,5,6) 
and (Sx, 62,63) = (0.1,0.2,0.3), a straightforward calculation (carried out us- 
ing MAPLE) reveals that g(0) — g({w}) < 0. On the other hand, for the 

case in which cti = (5% = -fi = 6i = i, i = 1, 2, 3, we have g(0) — g({w}) > 0. 

Remark 2.4. Before we turn to the proof of Theorem 1.3, let us re- 
view the algebraic calculations appearing in the proof of Theorem 1.1. In 

(2.6) and (2.7), introduce the notation n,, = fi(a U {z}), V{ = fi(a), i = l,2, 3; 
and let uix = /u(a U {z}) and u>2 = M a ) be nonnegative weights. With u = 
(ux,v,2,us) and v = (vx, 112,1)3) as variables, define the polynomials 

P(2,1){U]V) := (UJX + W 2 )[(^l«l«2 + UJ 2 VXV 2 )(UJXU 3 + LJ2V3) 

+ (0JXUXU3 + 0J 2 VxV?,){uxU2 +UJ2V2) 
+ (UJXU2U3 + U)2V 2 V3)(CJXUX +^2«l)]> 
P(1,1,1){U]V) := (uJxUx+u;2Vx)(uxU2+UJ2V2)(uiU3+UJ2V 3 ). 

Also define 

:= 2p {3) (u;v) - P( 2 ,i)(u;v) +P(i,x,i)(u',v), 

which is precisely the left-hand side of (2.11). Then the nonnegativity of 
(2.11) is equivalent to the nonnegativity of $(u;v) under the restrictions 
that m > Vi > 0, i = 1,2,3. Equivalently, to establish (2.5), we only have 
to show that the polynomial <&(u + v; v ) is nonnegative under the restric- 
tions Ui > 0, Vi > 0, i = 1,2,3. However, the package MAPLE produces the 




OLi, 

Oii + Pi, 

OLi +7i, 

o-i + Pi + H + 6i, 
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stronger result that, in the monomial expansion of 3>(u + v;v), all the coeffi- 
cients appearing are nonnegative; in fact, MAPLE calculates the monomial 
expansion of Q(u + v; v) to be 

<&{U + V,V) = U>llV2(2VlV2V 3 + ViV 2 U 3 + VlU 2 V 3 + UiV 2 V 3 ) 

+ 0Jl0jl(viV2V3 + V1V2U3 + V 1U2V3 + U1V2V3). 

Now it becomes clear that generalizations of Theorem 1.1 can be estab- 
lished in a similar manner. Indeed, any conjectured generalization involv- 
ing m functions fi, ■ ■ ■ , f m will be valid if, with u = (u\, . . . , u m ) and v = 
(yx, . . . , v m ), the coefficients in the monomial expansion of the correspond- 
ing polynomial &(u + v; v) are all nonnegative. This is the approach we 
adopt to establish the nonnegativity of the fourth- and fifth-order conjugate 
cumulants. 

PROOF of Theorem 1.3. To show that (1.8) is nonnegative, we follow 
the same approach as in the case of Theorem 1.1. To initiate the proof by 
induction, it is straightforward to verify that the result is valid for the case 
in which A = . 

Now we turn to the inductive hypothesis for nonempty A. In the case of 
four increasing functions fi, i = 1,2, 3, 4, the claim analogous to (2.5) is that, 
for any B <Z A, 

6Eb((/i/ 2 / 3 / 4 )b) - 2[E b ((/i/ 2 / 3 )b/4b) + • ■ •] 

(2.16) _[ Eb ((/ 1 / 2 ) b (/ 3 / 4 ) s ) + ...] 

+ PEb((/i/ 2 )b/ 3 b/4b) + ' ' '] +EB(/lB/2B/3B/4B) > 0. 

Proceeding as in Remark 2.4, we apply MAPLE to verify that all coefficients 
in the monomial expansion of the corresponding polynomial $(u + v\v) are 
nonnegative. Once this has been done, the remainder of the proof follows 
the arguments given in the latter part of the proof of Theorem 1.1. 
To prove that (1.9) is nonnegative, we begin with the claim that 

24Eb((/i/ 2 /3/4/ 5 )b) - 2[E b ((/ 1 / 2 / 3 / 4 )b/5b) + • ■ •] 

-[Efl((/i/ 2 / 3 )B(/4/6)fl) + ---] 

(2.17) - [E s ((/ 1 / 2 / 3 ) s /4b/5b) + • • •] 
+ [Eb((/i/ 2 )b(/ 3 / 4 )b/ 5 b) + ---] 

+ [Eb((A/ 2 )b/3b/ 4 b/5b) + •••]" EB(/lB/ 2 B/3B/4B/5B) > 

for any B C A. Next we construct the corresponding polynomial <&{u;v), 
apply MAPLE to verify the nonnegativity of all coefficients in the monomial 
expansion of + v; v), and then the remainder of the proof is as before. 

□ 
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To formulate a conjecture for the case of m increasing functions, we need 
some preliminaries from the theory of partitions [see Macdonald (1995)]. 

A partition A = (Ai,A2, ■•■) is a sequence of nonnegative integers with 
Ai > A2 > • • • • The parts of A are the nonzero A*; the weight of A is |A| := 
Ai + A2 + • • • ; and the length of A, denoted by l(X), is the number of parts 
of A. 

Given a partition A, for each 2 = 1,2,..., let A- denote the cardinality of 
the set {j : Xj >i}. Then X[ > X' 2 > • • • , and the partition A' = (A' l5 A' 2 , • • • ) is 
called the partition conjugate to X. It is not difficult to verify that (A')' = A 
and that X\ = l(X). 

For m £ N, m > 2, let <3 m denote the symmetric group on m symbols. 
For any permutation r € & m and any vector u = (u\, . . . ,u m ) € M. m , define 
t ■ u:= (u r (!), . . . ,u T ( m )), the standard action of 6 m on W 71 . 

Let fi , . . . , f m be functions on the lattice L and let /* be a probability 
measure on L. For any partition A = (Ai, . . . , A m ) of weight m, define 

iW / ^ \ 

i+-+Xj-i+k I ) 
j=l \k=l ) 

where expectations are with respect to the measure \i. We denote by D(X) 
the set of all r E S m which give rise to distinct permutations V\(t • . . . , f m )) 
of V\(fi, . . . , f m ). Then our conjecture for an mth-order generalization of the 
FKG inequality is the following: 

Conjecture 2.5. Let \i be an MTP2 probability measure on the finite 
distributive lattice L, and let fx,-. - ,f m be nonnegative increasing functions 
on L. Then 

(i) There exists a set of nonzero constants {c\ G Z : |A| = m} such that 

V m (fl,...,fm):= E C A E n(T-(fl,...Jm)) 

|A|=m rGD(A) 

is nonnegative. 

(ii) For each m>3, there exists a constant d m such that 

(2.18) V m (fl, ■ ■ ■ ,/m-l, 1) = dmVm-lU^ • ■ -,/m-l)- 

(iii) If fj = 1 for all j = 1, ... , m, £/ien 'P m (l, . . . , 1) = 0; equivalently, 



(2.19) 



E card( J D(A))c A = 0. 
|A|=m 
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For general to, the mth-order cumulant of a set of random variables 
/i, . . . ,f m is 

K m(/l; • • • > fm) 

(2.20) 

:= £ (_i)KA)-i(/(A)_i)! Pa(t-(/i,...,/ w )), 

|A|=m re-D(A) 

and we define the mth-order conjugate cumulant 
(2.21) 

|A|=m re-D(A) 

For m = 2, 3, 4, 5, the conjecture is valid if we choose for "P m the conjugate 
cumulant K f m . For these values of m, the coefficients ca = (— 1)'^ -1 (Z(A') — 
1)!, |A| = m, in the expansion of K r m (fi, • • • , /m) are an nonzero and satisfy 
(2.19). Also, it can be verified that, for m = 3,4, 5, the K' m satisfy (2.18) with 
d m = m — 2. In light of Theorems 1.1 and 1.3, all the preceding calculations 
provide evidence for the general conjecture. 

For m > 6, it appears that the conjugate cumulants do not satisfy (2.19). 
In fact, it appears to us that K>' m (l, . . . , 1) > for all to > 6. 

3. Diffusion processes. Once we have a generalization of the FKG in- 
equality within the context of finite distributive lattices, we can transfer 
that result to the context of functions and measures on M n using standard 
approximation procedures [see Karlin and Rinott (1980)]. In this section 
we give a direct approach, using the method of diffusion processes, to our 
generalizations of the FKG inequalities on W n . This approach is based on 
the ideas of Herbst and Pitt (1991) and we first present some preliminary 
material, all of which is abstracted from Herbst and Pitt (1991). 

We denote by C(R n ) the space of real- valued continuous functions on 
R n . Further, we define C b (R n ) = {/ 6 C(W n ) : WfW^ < oo} and denote by 
Cfr°(M. n ) the space of functions / which have bounded continuous derivatives 
of all orders. We also denote by M. the space of functions / G Cb(M n ) which 
are increasing. 

A probability measure /i on lR n is said to be 'positively correlated, or as- 
sociated, if 

/ fi(x)f 2 (x)dfi(x)> fi(x) dfj,(x) ■ f 2 (x)d[i(x) 
J«. n Jn. n Jm™ 

for all fi , /2 £ Ai. We denote by V the collection of all probability measures 
ji which are positively correlated. 
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Let {Pf : t > 0} = {P(t; x, dy) : t > 0} be a Markov transition semigroup of 
probability measures on IR n . Assume that {Pt'-t > 0} is Feller-continuous; 
that is, the operators Pt, defined by 

P t f(x):= I f(y)P(t;x,dy), 

map the space C(,(IR n ) into itself. The semigroup {Pt : t > 0} is called mono- 
tonic, or is said to leave A4 invariant, if, for all t > 0, Ptf £ M whenever 
f £ A4. The semigroup {P( :t > 0} also acts on measures by 

nP t (A):= f fi(dx)P(t;x,A)= f f I A {y)^{dx)P{t;x,dy), 

JR" JM" JR" 

where Ia{') denotes the indicator function of the measurable set A; equiva- 
lently, for any / £ Cb(M ri ), the measure \xPt is defined by 

/ f(x)dfiP t (x)= f f f(y)dfi(x)P(t;x,dy). 

JR" JR" JR" 

If ^Pt £ P for all t > whenever \i£V, then we shall say that {Pt : t > 0} 
preserves positive correlations. Thus {P( : t > 0} preserves positive correla- 
tions if 

(3-1) ^PtUih) ~ (^P/i)(/iP/ 2 ) > 

for all n £ V, t > and /i, f 2 £ M. 

The (strong) infinitesimal generator G of {Pt : t > 0} is the linear operator 
on Cf,(IR n ) given by 

(3.2) Gf(x):= lim Pg/(x) = f{x) , 

where the convergence is uniform in x. The domain T>(G) of G is the class 
of all functions / for which the limit exists. 

For each i = l,...,n, denote by di the partial derivative, d/dxi, with 
respect to re,, the ith coordinate of the vector x. 

Let a(x) = (aij(x)), x £ W 1 , be a symmetric positive semidefinite matrix- 
valued function on W 1 and let b(x) = (bj(x)) be a vector field on M n . Denote 
by G the differential operator 

n n 

(3.3) Gf(x) := \ a itj (x) d { d 3 f{x) + ^ b 3 {x) d s f{x) 

i,j=l j=l 

with domain V(G) = {/ £ C£°(R n ) : Gf £ C b (R n )}. 

Following Herbst and Pitt (1991), we call {Pt : t > 0} a diffusion semigroup 
with diffusion coefficients a(x) and b(x) if V(G) C and G/ = G/ for 

all / £ V(G). We will assume throughout that G generates {Pt : t > 0} in the 
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sense that there exists a unique diffusion semigroup {Pt : t > 0} correspond- 
ing to given coefficients a(x) and b(x); a sufficient condition under which G 
generates {Pt :t > 0} is given by Herbst and Pitt [(1991), (1.3)] [cf. Chen 
and Wang (1993)]. 

Applying the semigroup property P s +e = P s Pe an d (3.2), it is straightfor- 
ward to show that, for any / € T>(G), 

(3.4) §- s Psf(x) := ffin P '+^~ = G P s f{z) = P s Gf(x), 

with the limit holding uniformly in x. 

For /i, f'2 € C 1 (]R n ), we shall use the notation 

)( 

(3.5) T x {h,f 2 )(x) := J2 a iJ {x)d i h{x)djh{x), 

x € W 1 ; this operator is also known as the carre du champ operator [cf. Hu 
(2000)]. For /i,/ 2 £ V(G), it is straightforward to verify that 

(3.6) iM/L/aXs) = G(hf 2 ){x) - h(x)G{f 2 )(x) - f 2 (x)G(f 1 )(x), 

so that T± can be viewed as measuring the extent to which the operator G 
is a derivation. 

By combining the results of Herbst and Pitt (1991) and Chen and Wang 
(1993), we obtain the following criterion for monotonicity of the semigroup 
{Pf-t>0}. 



Theorem 3.1 [Herbst and Pitt (1991) and Chen and Wang (1993)]. The 
semigroup {Pt : t > 0} is monotonic if and only if 

(i) for all i,j = l,... ajj(x) depends only on X{ and xj, and 

(ii) for all j = 1, . . . , n, bj(x) > bj(y) whenever x > y with Xj = yj. 



A necessary and sufficient condition for preservation of positive correla- 
tions is the following result of Herbst and Pitt [(1991), Theorem 1.3]. 



Theorem 3.2 [Herbst and Pitt (1991)]. The semigroup {Pt :t > 0} pre- 
serves positive correlations if and only if 

(i) {Pt : t > 0} is monotonic, and 

(ii) a,ij(x) > for all x £ W 1 , i,j = 1, . . . ,n. 
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Theorem 3.1 can be viewed as providing conditions under which the semi- 
group {Pt : t > 0} preserves "second-order" correlations. In the sequel we 
show, generalizing (3.1), how {Pt : t > 0} preserves certain third-order corre- 
lations. The following result is motivated by Herbst and Pitt (1991), notably 
their Proposition 4.1, and the proof below is along the lines developed by 
them. 

Proposition 3.3. Suppose that the diffusion semigroup {Pt-t>0} is 
monotonia, that 

(3.7) Ti(fuMx)>0 

for all x € M n and for all smooth fi, fa G M, and that 04 € M for all i,j = 1, 
...,n. Then 

2P t (f 1 f 2 f 3 )(x) - [P t (fafa)(x)P t fa(x) 
(3-8) + P t (fafa)(x)P t fa(x) + P t h{x)P t {f 2 h){x)] 

+ P t fl{x)P t f2(x)P t f 3 (x)>0 

for all nonnegative smooth fi, f 2 , fa € A4 and all x £ M. n . 

Proof. Since the class of smooth functions is dense in A4 in the topol- 
ogy of bounded locally uniform convergence, it suffices to prove (3.8) for the 
case in which fa, fa, fa£Mn C£°{R n ). 

Now fix t > and define the function 

h(s) = 2P s (P t . s fa ■ P t - S fa ■ Pt- S fa)(x) 

- [Ps{Pt-sfl ■ P t --sfa){x) ■ Ptfa(x) 

(3.9) + P s (Pt- s fa ■ Pt-sfa)(x) ■ P t fa(x) 

+ P t fa(x)-P s (P t - s fa-P t -sfa)(x)} 

+ P t fa(x) ■ P t fa(x) ■ P t fa(x), 

0<s<t. Observe that h(0) = and 

h{t) = 2P t {fafafa)(x) 

-[Pt(fafa)(x)-P t fa(x) 

(3.10) 

+ Pt(fafa)(x) ■ Ptfa(x) + P t fa{x) ■ Pt(fafa){x)] 
+ P t fa(x) ■ P t fa(x) ■ P t fa(x). 

Thus, to establish (3.8), we need only show that h'(s) > for all s G (0, t). 
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We now apply (3.4) repeatedly to (3.9) to differentiate h. Suppressing the 
notational dependence of all functions on the argument x, we obtain 

h'(s) = 2[P s G(P t - s h ■ P t - S f 2 ■ P t - S h) - P s (GP t . s h ■ P t - S h ■ Pt-sh) 

- P s {Pt-sh ■ GP t - s f 2 ■ P t - S h) - P s (P t - s fl ■ P t -sf2 ■ GPt-sfs)] 

- [P s G{P t - s f X ■ P t - S f 2 ) 

- P s (GP t . s h ■ P t - S h) - P s (Pt-sfl ■ GP t -sh)]Pth 

- [P s G(P t - s h ■ Pt-sh) 

- P s {GP t -sh ■ Pt-sh) - Ps{Pt-sh ■ GP t -sh)]Pth 

- [P s G{P t - s f 2 ■ Pt-sh) 

- P s (GP t _J 2 ■ Pt-sh) - Ps(Pt-sh ■ GP t -sh)]Pth- 

Equivalently, denoting Pt~ s fi by gi, i = 1,2, 3, what we have shown is that 
h'(s) = 2[P s G( gi g 2 g 3 ) - P s (g2g 3 G gi ) - P s (gig 3 Gg 2 ) - P s (gig 2 Gg 3 )} 

- [PsG( gi g 2 ) - P s (g2G gi ) - Ps(giGg 2 )}P s g 3 

- [P s G(g m ) - P s (g 3 G 9l ) - Ps(giGg 3 )}P s g 2 

- [P s G(g 2 g 3 ) - P s {g 3 Gg 2 ) - P s (g 2 Gg 3 )]P s9l . 
Using the definition of Ti(-, •) in (3.5), we have 

h'(s) = 2P s [G(g 1 g 2 g 3 ) - g 2 g 3 Gg x - gig 3 Gg 2 - gig 2 Gg 3 ] 

(3.11) -[PsT^g^-Psgs 

+ P s F 1 (g 1 ,g 3 ) ■ P s g2 + PsFi{g 2 ,g 3 ) ■ P s g\). 

For {i,j,k} = {1,2,3}, we express each term P s Ti(gi, gj) ■ P s gk in (3-11) in 
the form 

P s ^l(9i,9j) ■ P s gk 

= P s (Ti( gi , gj ) -g k ) - [P s {Yi( gi , gj ) -g k ) -P s Ti(gi,gj) ■ P s g k ], 
and then we obtain 

ti (s) = P S T 1 ( gi ,g 2 ,g 3 ) + [P s (Ft ( 9l , g 2 ) ■ g 3 ) - P S T 1 ( 9l ,g 2 )- P s g 3 ] 

(3.12) + [P s (T 1 (g 1 , g 3 ) ■ g 2 ) - P.T^gs) ■ P s g 2 ] 
+ [Ps{Fi{g 2 ,g 3 ) -gi) - P s Ti(g 2 ,g 3 ) ■ P s gi], 

where 

Ti(gi,g2,g3) ■= 2[G(gig 2 g 3 ) - g 2 g 3 Gg\ - g\g 3 Gg 2 - g\g 2 Gg 3 \ 

(3.13) - [g 3 Ti{gi,g 2 ) +g 2 Ti(gi,g 3 ) + giTi(g 2 ,g 3 )] 

= ^G(g 1 g 2 g 3 ) - g±G(g 2 g 3 ) - g 2 G{gig 3 ) - g 3 G(gig 2 ). 
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By a direct calculation using (3.3) and (3.5), we obtain from (3.13) the 
identity 

n 

11(31,52,53) =gi ^2 a i,jdi92djg3 

n n 

(3.14) +92 ai >i 9 i9l djg-3 + 93 a-ij digi <% 2 

i,j=l i,j=l 
= 9l^l{92,g3) + 92^1(91,93) + 93^1(91, 92)- 

Since /1, / 2 and /3 are nonnegative, then so are g±, g 2 and 33. By the 
assumptions that {Pt'-t > 0} is monotonic and that (3.7) holds, we find 
that gi G M n C b °°(]R n ), i = 1,2, 3, and ft) > 0, 1 < k < I < 3; therefore 

Fi (91, 32, 33) > 0. 

Since each G M, then > for all i = 1, . . . ,n. By assumption, 
each aij € A'!. Therefore it follows from (3.5) that, for any pair of smooth 
gk,gi G M, ^i(gk,9i) is a nonnegative linear combination of elements of A4; 
hence Ti(gi,gj) G By Proposition 4.1 of Herbst and Pitt (1991), the 
semigroup {Pt'-t > 0} is known to preserve positive correlations; therefore 
each term P s (Ti(gi,gj) ■ g k ) - P s T 1 (g i ,g j ) ■ P s g k is nonnegative, establishing 
that (3.12) is a decomposition of h'(s) into nonnegative terms. This proves 
that h'(s) > 0, s G (0,t), from which we conclude that h(t) > 0. □ 

Remark 3.4. (i) Consider the case in which = 1 in the foregoing 
proof. Then the terms P s (T 1 (g i ,g j ) ■ g k ) - P s Ti(gi,gj) ■ P s g k are identically 
zero, so that no monotonicity conditions on the a,ij are required. Then we 
recover a result of Herbst and Pitt [(1991), Proposition 4.1]. 

(ii) By analogous arguments we can extend some results of Wang and 
Yan (1994) on positive correlations for diffusion processes on n-dimensional 
tori. 

(hi) Results similar to Proposition 3.3 have played a prominent role in 
the study of functional inequalities, for example, log-Sobolev inequalities; 
see Hu (2000). In work now in progress, we will study extensions of those 
inequalities by means of generalizations of Proposition 3.3. 

As a consequence of Proposition 3.3, we now obtain an analog of Theo- 
rem 1.1 for functions on IR n ; here again, we follow arguments given by Herbst 
and Pitt [(1991), Corollary 1.7] in their proof of the FKG inequality. 

Corollary 3.5. Suppose that n(dx) = exp(ijj(x)) dx is a probability 
measure on M n , where ip G C 2 (M n ) satisfies the properties that ip(x) > —00 
and didjip(x) > for all i 7^ j and for all x G W 1 . If f±, / 2 and f$ are 
nonnegative functions in M. such that the various expectations exist, then 

4(/i>/2,/ 3 )>o. 
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Proof. Construct the diffusion process {Pt-t > 0} with infinitesimai 
generator 

n n 
3=1 3=1 

corresponding to (3.3) with a^j = 25ij and bj = djijj. It is well known that 
the process {Pt :t > 0} is monotonic [by Theorem 3.1, this is equivalent to 
the assumption that didjtp(x) > for all x and i 7^ j]. Also, ri(/i,/2) > 
for all smooth f\, h 6 M and the etjj, being constants, clearly are elements 
of A4. Applying Proposition 3.3, we deduce that (3.8) holds. Letting t — > 00 
in (3.8) and noting that P t f — >E(/), almost surely, we obtain (1.7). □ 

It also is clear how to extend Proposition 3.3 to the case of four or more 
functions. We have found the algebraic calculations to be more extensive, 
indicating that an alternative approach is needed to resolve the case of an 
arbitrary number of functions. Nevertheless, the strategy is the same, and we 
illustrate it by sketching the details for the case of four functions. Suppose 
that /1, . . . , fi G M. are nonnegative and smooth. For x € W 1 , define g% = 
Pt-sfi, i = 1, • ■ • ,4 and 

h(s) = 6P s (g 1 g 2 g3g4) 

- 2[P s ( gi g 2 g 3 ) ■ P t U + Ps{gig294) ■ Ptf 3 

+ P s (9l9394) ■ Pth + P s {92939a) ■ P t fi] 

- [Psigm) ■ P s {g?,gi) + P s {gm) ■ ^(#254) + P s {gi9i) ■ ^(0253)] 
+ [P s (gm) ■ Ph ■ Ph + Ps(gm) ■ Ph ■ Ph 

+ P(gm) ■ Ph ■ Ph + P s (9293) ■ Ph ■ Ph 
+ P s {g29A) ■ Ph ■ Ph + ^(5354) • Ph ■ Pth] 
+ Pth- Pth- Pth- Ph, 

< s < t, where, as usual, we have suppressed all notational dependence of 
functions on x. Define 

^1(91,92,93,94) 

■= 9i92^i(93,9a) +9193^1(92,94) +9194^1(92,93) 
+ 9293^1(91,94) +9294^1(91,93) +9394^i(gi,92), 

and denote T 1 (g 2 , 93,94) by Yi({g x ,. . .,54} \ {gi}), and so on. For < s < t, 
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a lengthy, but straightforward, calculation leads to the result 
h'(s) = P s r 1 (g 1 ,g2,g3,g i ) 

4 

+ 2£[p s (< ?i -r 1 ({<?!,..., 54 } \ {#})) 

- p s ^- • p s ri({^i, . . . ,54} \ {*})] 
+ 51 [P s (gi9j ■ri({5i,...,54}\{ffi,5 J })) 

l<i<j<4 

- Ps(gi9j) ■ P s Ti{{gi, . .. ,54} \ 
+ 51 ■- p ^ r i({5i,---,5 , 4}\{5'i,5 , j})- 

l<i<j<4 

Now we apply the same arguments which concluded the proof of Proposi- 
tion 3.3. Then the above expression is seen to be a resolution of h'(s) into 
nonnegative terms; hence, under the hypotheses of Proposition 3.3, h'(s) > 0, 
< s < t, and therefore h(t) > h(0) = 0. Finally, we argue as in Corollary 
3.5 to deduce k'^ > 0. 

It is interesting to compare the techniques applied in Sections 2 and 3. 
On the one hand, the lattice formulation in Section 2 required no analytical 
considerations; however, the algebraic manipulations were sufficiently com- 
plicated that we found it necessary to use the MAPLE package to decompose 
expressions as sums of nonnegative terms. On the other hand, in the present 
section, analytical machinery was needed but the algebraic calculations ap- 
peared to be simpler and we made no use of the MAPLE package. 

4. Duplicate variables. Another approach to establishing the classical 
FKG inequality on ]R n is by way of the method of duplicate variables; see 
Cartier (1974) and Glimm and Jaffe (1987). Therefore it is natural to search 
for a proof of Theorem 1.1 using this method. For n = 1, we have such a 
proof; however, the inductive step seems difficult and we have not been able 
to find it. To demonstrate the difficulties inherent in the inductive step, we 
now establish the case in which n = 1 . 

Here, the functions fi, ji and fs are nonnegative and increasing on ]R and 
we wish to show that the conjugate cumulant ^(/i, /2, ./s) 1S nonnegative. 
By duplicating variables we have 

4(/i>/2)/3) = / I(x 1 ,x 2 ,x 3 )dfi(xi)dn(x 2 )diJ,(x 3 ), 
Jl 3 
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where 

I(x 1 ,x 2 ,x 3 ) 

= 2/i(x 1 )/ 2 (xi)/ 3 (xi) 

- [/i(x 1 )/ 3 (ari)/ 3 (x2) + fi(xi)f2(x 2 )h(xi) + /i(x 2 )/ 2 (x 1 )/ 3 (x 1 )] 
+ /i(^i)/ 2 (>2)/3(»3)- 

Define 

J(xi,x 2 ,x 3 ) = J(r • (xi, x 2 ,x 3 )); 

rG6 3 

then, by a symmetry argument, we obtain 

4(/li/2,/3) = / J(xi,x 2 ,x 3 )d/i(xi)d//(x 2 )d/4x 3 ). 

With the help of MAPLE we find that 

J(x 1 ,x 2 ,x 3 ) = / 3 (xi)(/i(x 3 ) - /i(xi))(/ 2 (x 3 ) - / 2 (xi)) 

+ /i(x 3 )(/ 2 (x 3 ) - / 2 (xi))(/ 3 (x 3 ) - / 3 (xi)) 
+ / 2 (xi)(/i(x 3 ) - /i(xi))(/ 3 (x 3 ) - / 3 (xi)) 

+ / 3 (xi)(/l(x 2 ) " /1(X!))(/ 2 (X 2 ) - / 2 (X!)) 
+ A(X 2 )(/ 2 (X 2 ) - / 2 (X!))(/ 3 (X 2 ) - / 3 (X!)) 
+ / 2 (xi)(/l(x 2 ) - /1(X!))(/ 3 (X 2 ) - / 3 (X!)) 

41 + /i(x 3 )(/ 2 (x 3 ) - / 2 (x 2 ))(/ 3 (x 3 ) - / 3 (x 2 )) 

+ / 3 (x 2 )(/i(x 3 ) - /i(x 2 ))(/ 2 (x 3 ) - / 2 (x 2 )) 
+ / 2 (x 3 )(/i(x 3 ) - /i(x 2 ))(/ 3 (x 3 ) - / 3 (x 2 )) 

+ (fl(x 3 ) - /l(x 2 ))(/ 2 (x 2 ) - / 2 (X!))(/ 3 (X 3 ) - / 3 (X 2 )) 

+ (fi(x 3 ) - /i(xi))(/ 2 (x 3 ) - / 2 (x 2 ))(/ 3 (x 3 ) - / 3 (x 2 )) 

+ (fl(x 3 ) - /1(X 2 ))(/ 2 (X 3 ) - / 2 (X 2 ))(/ 3 (X 2 ) - / 3 (X!)). 

Since /i, / 2 and / 3 are nonnegative and increasing, we deduce immedi- 
ately that J(xi,x 2 ,x 3 ) > on the "fundamental" chamber {x± < x 2 < x 3 }. 
It then follows by symmetry that J(xi,x 2 ,x 3 ) > on every Weyl chamber 
{x r (!) < x T ( 2 ) < x T ( 3 )}, r € (5 3 . By taking limits as (xi,x 2 ,x 3 ) goes to a 
boundary {x T m < x T ( 2 ) < x t(3)}i r € S 3 , we find that J(xi,x 2 ,x 3 ) remains 
nonnegative on every wall of a chamber. Therefore k 3 (/i,/ 2 ,/ 3 ) > 0. 

To carry out the inductive step toward higher dimensions requires a 
method for handling functions of the form J in (4.1); unfortunately, we have 
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not been able to develop such a technique. More importantly, our usage of 
the terminology "Weyl chamber," "wall" and "fundamental chamber" is not 
accidental, for we believe that inequalities of FKG-type exist within the 
context of arbitrary finite reflection groups; see Gross and Richards (1995). 

5. Applications. Any generalization of the FKG inequality has the po- 
tential for vast numbers of applications. In this section we provide a few 
applications and point the reader to others. We begin with generalizations 
of some applications given by Seymour and Welsh (1975). 

5.1. Generalized inequalities for Bernstein polynomials. For / E C[0, 1], 
the Bernstein polynomial of / is the polynomial defined on [0, 1] by 

Generalizing Theorem 2.6 of Seymour and Welsh (1975), we have the fol- 
lowing result. 

Proposition 5.1. Iffi, f 2 and are nonnegative increasing functions 
on [0, 1] , then their Bernstein polynomials satisfy 

2B n (hf 2 h){x) 

- [B n {hf 2 ){x)B n h{x) 

(5.1) 

+ B n {hh){x)B n f 2 (x) + B n f 1 (x)B n (f 2 f 3 )(x)} 
+ B n f 1 (x)B n f 2 (x)B n f 3 (x) >0 

for all x € [0, 1] . 

Proof. Let A be a set of n elements and let x G [0, 1]. Define a proba- 
bility measure /i on the lattice 2 A by 

H(a) = cx card(a) (l - x )"- card («), 

a E 2 A , where card (a) denotes the cardinality of a and c is the normalizing 
constant; it is well known that [i satisfies the MTP2 condition (2.1). Next 
we define 

fj( a ) = fj(card(a)/n), 

j = 1,2,3, a £ 2 A . Then the functions fj are nonnegative and increasing on 
2 A . On applying Theorem 1.1 to the probability measure fi and the functions 
fj, j = 1,2,3, we obtain (5.1). □ 

By arguing along similar lines, inequalities for Bernstein polynomials for 
four and five functions can be obtained through application of Theorem 1.3. 
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5.2. Inequalities for log-convex sequences. Recall that a sequence of real 
numbers {ao, • . • , a n } is called log-convex if a\ < a k -\a k+ i for all k = 1, . . . , n — 1. 

For any sequence {cto, ■ ■ ■ ,a„}, we will use the notation 

n 

{Wk}) :=^a k . 

k=0 

The following result extends Theorem 3.2 of Seymour and Welsh (1975), 
which itself is a generalization of a classical inequality of Tchebycheff. 

Proposition 5.2. Suppose that {ao, • • • , a n } is a positive, log-convex se- 
quence with ({a k }) = 1- Suppose also that the sequences {ao, • • • , a n }, {/3o, ■ • ■ , f3 n } 
and {70, . . . ,7n} ore increasing and nonnegative. Then 

2({a k a k (3 k -f k }) - {{a k a k /3 k })({a k -f k }} 

(5.2) - ({a k a k -f k })({a k (3 k }) - {{a k a k })({a k (3 k ~/ k }} 
+ {{a k a k })({a k (3 k })({a k j k }) > 0. 

Proof. The proof of this result is similar to the proof of the correspond- 
ing result given by Seymour and Welsh (1975). Define the sequence 

** = «*/(*)' 

k = 0, . . . ,n. Since the sequences {a k } and {l/(^)} are log-convex, then so 
is {b k }. 

Let A = {1, ... , n} and define n : 2 A -> R by 

H(a) = 6 car d(a) 5 

a G 2^. It is shown by Seymour and Welsh (1975), that fi satisfies (2.1). 
Define fr, f 2 , f 3 :2 A by 

fl (a) = a card ( a ) , h{o)= /3card(a) , /3 ( a ) = 7card(a) > 

a € 2^. Then the functions /j are nonnegative and increasing on 2 A . On 
applying Theorem 1.1 to the probability measure proportional to fi and the 
functions fj, j = 1,2,3, we obtain the desired result. □ 

5.3. A generalization of Kleitman" 's lemma. A collection C of subsets of 
a set A is closed above if a G C and a C 6 imply 6 G C. Similarly, C is closed 
below if a G C and a D b imply b G C. 

Suppose that A is a finite set of cardinality n, and let U and L be col- 
lections of subsets of A such that U is closed above and L is closed below. 
Then Kleitman (1966) proved the remarkable inequality, 

(5.3) 2 n card(C7 n L) < card(C/) card(L) . 
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The following result generalizes this inequality; see Seymour and Welsh 
[(1975), Theorem 4.2]. 

Proposition 5.3. Let A be a finite set of cardinality n, and let U±, U 2 , 
and L be collections of subsets of A. Suppose that U\ and U 2 are closed above 
and L is closed below. Then 

2 2n card(C/i n U 2 ) - 2 2n+1 card(C/i n U 2 H L) 

+ 2 n (card([/i n[/ 2 )card(L) 

(5.4) 

+ card([/i n L) card(C/ 2 ) + card(C/ 1 ) card(^7 2 n L)) 
- 2 n card(C/i) card(f/ 2 ) - card(E/i) card(C/ 2 ) card(L) > 0. 

Proof. Let /1 be the uniform distribution on 2 A , that is, fi(a) = l/2 n 
for all a S 2 A . Let /1 and / 2 be the characteristic functions of the sets Ui 
and U2, respectively; that is, for j = 1, 2, 

/jW-\0, ifo^Uj. 

Also let /3 denote the characteristic function of L c , the complement of L. 
Then fi, / 2 and 1 — /3 are nonnegative increasing functions. Then (5.4) is 
obtained by applying Theorem 1.1 to the measure \i and the functions fi, 
f 2 and 1 - / 3 . □ 

For the case in which U± = U is closed above and U 2 = 2 A , the inequality 

(5.4) reduces to (5.3). 

5.4. Inequalities for matrix functions. 

Definition 5.4. Let R= (R(i,j)) be an n x n real matrix. Then 

(i) R satisfies the triangle property if 

(5.5) R(i,j)R(k, k) = R(i, k)R(k,j) 

for all i < k < j and all j < k < i [see Barrett and Feinsilver (1981)]; 

(ii) R is nonnegative if R(i,j) is nonnegative for all i and j; 

(iii) R represents a discrete probability distribution if R is nonnegative 
and£^ =1 #(i,j) = l; 

(iv) R is increasing if R(i,j) is monotone increasing in i and in j. 

For any n x n matrix R = R(i,j) representing a discrete probability dis- 
tribution and any n x n matrix F = (F(i,j), we denote ti(RF') by Er(F). 
Because 

n n 

tt(RF')=Y i Y i R(i ) j)F(iJ), 
i=tj=i 
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the notation E R (F) for tr (RF r ) is consistent with the notation for expecta- 
tion with respect to the discrete probability distribution represented by the 
entries of the matrix R. 

Given two n x n matrices F\ = (fi(i,j)) and F2 = (f2(hj)), recall that 
the Hadamard product of F\ and F2 is the matrix F\ o F2 = (fi(i,j)f2(i,j))- 

Now we have the following result. 

Proposition 5.5. Let R be an nxn matrix which represents a discrete 
probability distribution and satisfies the triangle property. If F\ = (fi(i,j)), 
F2 = (faihj)) an d F3 = (f 3 (i,j)) are nonnegative increasing nxn matrices, 
then 

2E R (F 1 o F 2 o F 3 ) - [E R (F 1 o F 2 )E r (F 3 ) 

(5.6) +E R (F 1 o F 3 )E R (F 2 ) +E R (F 1 )E R {F 2 o F 3 )} 
+ E R {F 1 )E R (F 2 )E R {F 3 )>0. 

In particular, 

(5.7) E fl (Fi o F 2 ) > E R (F 1 )E R (F 2 ). 

Proof. Let A = {1, . . . ,n}, so that A x A is the set of pairs of positive 
integers ranging from 1 through n. Equip Ax A with the partial order given 
by (i>j) "4 {k, I) if i < j and k < I; this is the same partial ordering utilized in 
the study of the FKG inequality on Euclidean space. For pairs p = and 
q = {k, I) in A x A, we define p V q and p A q in the usual component-wise 
manner: 

p V q= ( max(i, /c), max(j, Z)), p Aq = (min(z, fc), min(j, I)). 

It is straightforward to check that the triangle property (5.5) is equivalent 
to the MTP2 condition with equality: 

R(pVq)R(pAq)=R(p)R(q) 

for all p, q € A x A. Therefore, any matrix R which represents a discrete 
distribution and satisfies the triangle condition corresponds to an MTP2 
probability distribution on Ax A. This observation is the crux of the proof, 
for we can now apply Theorem 1.1 to nonnegative increasing matrix func- 
tions to deduce (5.6). Finally, (5.7) is the special case of (5.6) in which 
fs{i,3) = l. □ 

We now turn to inequalities for rank and determinant functions of positive- 
definite functions. 
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Let M denote a fixed n x n positive-semidefinite matrix M. Given an 
index set a C A = {1, . . . ,n}, we denote by M[a] the submatrix of M ap- 
pearing in the rows and columns labelled by the elements of the set a. For 
any a,iCA, it is a result of Lundquist and Barrett (1996) that 

(5.8) rankMfa U b] + rankMfa n b] < rank M[a] + rank M [6]. 
For fixed t > 0, define the probability measure [i on 2 A by 

jVi— rank M [a] 
= J2 CA t«-rank M[a] ' 

a € 2^; the measure can be viewed as a generating function for the ranks 
of the submatrices M[a], a C A. 

It follows from (5.8) that \x satisfies the MTP2 condition (1.1) on the 
lattice 2 A . By applying Theorem 1.1 to the measure fj, and nonnegative 
increasing functions fj, j = 1,2,3, we obtain various positivity results. We 
leave it to the reader to work out special cases, for example, the case in 
which the fj are defined in terms of the characteristic functions of subsets 
of 2 A that are closed above or below, as necessary. 

Other examples arise by specifying the /,• to be functions analogous to 
those chosen by Seymour and Welsh (1975) in the proof of their Theo- 
rem 5.10. More generally, we may consider the context of matroid theory 
considered by Seymour and Welsh (1975) and deduce by application of The- 
orem 1.1 higher-order total positivity properties of the rank-generating func- 
tion of a matroid. 

To derive determinantal inequalities, suppose that M is a fixed positive- 
definite symmetric n x n matrix. We recall the generalized Hadamard- 
Fischer inequality 

(5.9) detM[aU&]detM[anfr] < det M[a] detM[6]; 

see Horn and Johnson [(1985), page 485]. For fixed t > 0, define the proba- 
bility measure 

detM[o] -t 
Ka) ~Z a cA det M[a]^ 

a € 2 A . Then fi satisfies the MTP2 condition (1.1) on the lattice 2 A . On 
applying Theorem 1.1 to various choices of the functions fj, we obtain de- 
terminantal inequalities. 

For example, suppose fi(a) = tr M[a], the trace of M[a]; /2(a) = A m i n (M[o]), 
the smallest eigenvalue of M[o]; and /3(a) = l/A max (M[a]), the inverse of 
the largest eigenvalue of M[a]. Then it is a consequence of the fundamental 
inclusion principle describing the interlacing properties of eigenvalues of sub- 
matrices of Hermitian matrices [Horn and Johnson (1985), Theorem 4.3.15, 
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page 185] that these fj are all increasing functions on the lattice 2 . There- 
fore we may obtain eigenvalue inequalities by application of Theorem 1.1. 
As a special case, by applying the FKG inequality to the functions /2 and 
/3, we obtain the inequality 



5.5. Monotonicity properties of partial orders. Suppose that (ai, . . . , a m , 
&1, . . . , b n ) is a uniformly distributed random permutation on & m + n , the set 
of permutations onm + n symbols. The permutation is to be viewed as the 
actual ranking of tennis skills of players ax, ... , a m , b\, . . . , b n . We suppose 
that a player x always loses to a player y in a match if x < y. In a contest 
between teams A = {ai, . . . , a m } and B = {b\, . . . , b n }, suppose that there is 
a partial order G between certain a's and between certain b's, for example, 
a% < CL2, a\ < 03, 62 < bi, . . . , but no information about relative rankings 
between any and bj. Such a situation arises if, to date, there have been 
numerous intrateam matches resulting, for example, in a\ losing to 02, a\ 
losing to 03, 62 losing to b\, and so on, but no interteam matches. We 
denote by P(a\ < &i[0) the conditional probability that b\ defeats a\ given 
the partial order 0. 

Following numerous matches between members of the A and B teams, 
suppose the result has been a victory for B in every case. Thus we now have 
information that the a's have so far lost each match to the Vs. This induces 
a new partial order, 9' = 6U 0", where 0" consists of inequalities of the 
form 04 < bj for some collection of i and j. We denote by P(a± < b\ |0') the 
conditional probability that b\ defeats a\ given the information in 0'. 

Graham, Yao and Yao (1980) [see Graham (1982, 1983)] proved that 



and Shepp (1980) later gave another proof using the FKG inequality. As 
observed by Shepp, the additional knowledge with 0' that a number of a's 
severally have lost to a l?-team member provides the basis for us to infer 
that the A-team is jointly inferior to the B-team. Therefore it is natural to 
expect a higher probability conditional on 0', than conditional on 0, that 
ai loses to b\ and (5.10) confirms this expectation. 

Shepp (1980) constructed a suitable finite distributive lattice, a nonnega- 
tive MTP2 measure n and two decreasing indicator functions / and g. Then 
he deduced (5.10) by an application of the FKG inequality. Along the same 





(5.10) 



P(ax <6i|0') >P{a x < bx |0) 
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lines, we obtain generalizations of (5.10) using Shepp's MTP2 measure by 
constructing three or more decreasing indicator functions fj and applying 
Theorem 1.1 or 1.3 to the functions 1 — fj. As an example, we state a result 
which follows from Theorem 1.1. 

Proposition 5.6. Let A be the subset of 6 m +n for which A and B 
have the complete order: 



Suppose that A\,...,A± are subsets o/G m + n , each of which is an intersection 
of subsets of the form ai < bj. For any A C 6 m+n , define 



Then 

2it(Ai nA 2 nA 3 ) 

- [ir(Ai n A 2 )ir(A 3 ) + ir(Ai n A 3 )ir(A 2 ) + tt(A 1 )7t(A 2 n A 3 )] 
+ Tr(A 1 )Tr(A 2 )n(A 3 ) 

- 2[n(Ai n A 2 ) - tt(Ai)tt(A 2 ) + tt(Ai n ^4 3 ) 



By similar arguments, we can also generalize related results of Shepp 
(1980) and the XYZ conjecture [Shepp (1982)]. 

5.6. Cumulants inequalities for probability distributions. Our initial mo- 
tivation for investigating the higher-order inequalities of FKG-type was to 
generalize numerous correlation inequalities well known for MTP2 proba- 
bility distributions in mathematical statistics; see Eaton (1987) and Karlin 
and Rinott (1980). We shall leave it to the reader to deduce from Theorems 
1.1 and 1.3 bounds on the third-, fourth-, and fifth-order cumulants of those 
probability distributions. 

The basis for much of Section 4 of Karlin and Rinott (1980) is a special 
case of the following result. 

Corollary 5.7. Let <p and ip be MTP 2 functions on W 1 and define 



for any function f for which the integrals converge. If fj, j = 1,2,3, are 
nonnegative increasing functions on R n , then ^{fi, $2, fs) > 0. 



A Q = {«!<••• < a m } n {61 < • • • < b n }. 



tt(A): 



p(A ni 4 ni) 
P{A n A A ) 



7r(Ai)7r(A 3 ) + tt(A 2 n A 3 ) - tt(A 2 )tt(A 3 )] < 0. 



E(/): 



f (p{x)il){x)f(x)dx 
f ip(x)tj}(x) dx 



30 



D. ST. P. RICHARDS 



The proof of this result follows immediately from the generalized third- 
order FKG inequality once we note that the probability measure which is 
proportional to (p(x)t/j(x) dx, x E R n , is an MTP2 measure. As applications 
of this result we can then deduce higher-order probability inequalities for 
any MTP2 random vector generalizing, for example, Example 4.1 of Karlin 
and Rinott (1980). In closing this section we provide, as a generalization of 
Proposition 4.1 of Karlin and Rinott (1980), a higher-order log-concavity 
property of exchangeable random variables. 



Proposition 5.8. Let X±, . . . , X n be exchangeable random variables hav- 
ing a joint MTP2 probability density function (p. For a£K, define co(a) = 1 
and 



c m (a) =P(X 1 <a,...,X m <a) 



1 < m < n. Then 



(5.11) 



r, c m+2 (a) _ 3 c m+ i(a) c m (a) 
Cm-lip) 



Cm-i(a) c m _i(a) 



+ 



c m (a) 3 

Cm-l(a) 3 



Cm+i(«) c m (a) 5 



Cm-i(a) c m _i(a) 2 



<0, 



m = 1, . . . ,n — 2. 



Proof. For x € R n , define 



1, if xi,...,x m -i <a, 
0, otherwise. 



It is well known that ip is MTP2. For k = 1,2, 3, define 
fk{x) 



1, if ^m+fe-l < a , 
0, otherwise. 

Then the functions /1, /2 and fs are decreasing. Moreover, E(/fe) = c m (a)/ 
Cm-i(a), = 1,2,3; E(/i/ 2 ) = E(/i/ 3 ) = E(/ 2 / 3 ) = c m+1 (a)/c m _i(a); and 
E(/i/2/3) = c m+2 (a)/c m _i(a). 

Now we apply Corollary 5.7 to the functions 1 — /1, 1 — /2 and 1 — / 3 . 
Simplifying the resulting expression, we obtain (5.11). □ 



By applying Proposition 5.8, we can obtain generalizations of other ex- 
amples given by Karlin and Rinott [(1980), Section 4]. 
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6. Remarks on total positivity and inequalities of FKG-type. In the de- 
velopment of inequalities of FKG-type, it will be instructive to study the 
case of indicator functions on R 2 . In what follows, for any a E R, we use the 
notation 

1, t > a, 
0, t<a, 

for the indicator function of the interval [a, oo). 

A prototypical increasing function / on R 2 is an indicator function of a 
"northeast" region, [a,co) x [b, oo), so that / is of the form 

1, u> a,v >b, 



Ia(t) 



(6.1) 



f(u,v) =I a (u)I b (v) 



0. 



otherwise, 



for some a,b € R. Let us consider the case of two such functions fj(u,v) = 
hji^hjiv), (u,v) G R 2 , where a,j,bj G R, j = 1,2. In establishing the FKG 
inequality for these functions we may assume, by symmetry, that a\ < a 2 . We 
denote by (X\, X 2 ) the random vector corresponding to the density function 
K. 

Suppose that b\ <b 2 . Then 

Ef 1 = P(X 1 >a 1 ,X 2 >b 1 )<l 

and 

Ehh = Eh = P(Xi > 02, X 2 > b 2 ) > 0. 

Therefore 

Cav(A, f 2 ) = Ehh - (Eh)(Ef 2 ) = Ef 2 - (Eh)(Ef 2 ) 
= (Ef 2 )(l-Ef 1 )>0. 
Next suppose that b\>b 2 . Then 

Cov(f 1 ,f 2 ) = Ef 1 f 2 -(Ef 1 )(Ef 2 ) 
= P(X 1 >a 2 ,X 2 >b 1 ) 

-P(X 1 >a 1 ,X 2 >b 1 )P(X 1 >a 2 ,X 2 >b 2 ) 

= P(X 1 > ai ,X 2 > h)[l - P{X l > a u X 2 > b 2 )] 

P(X 1 >a 1 ,X 2 >b 2 ) P(X 1 >a 1 ,X 2 >b 1 ) 
P(X 1 >a 2 ,X 2 >b 2 ) P(X 1 >a 2 ,X 2 >b 1 ) m 

The first term in (6.2) clearly is nonnegative, so it remains to establish 
nonnegativity of the second term. To that end, we write 

P(X 1 >a 1 ,X 2 >b 2 ) P(X 1 >a 1 ,X 2 >b 1 ) 
P(X 1 >a 2 ,X 2 >b 2 ) P(X 1 >a 2 ,X 2 >b 1 ) 



(6.2) 



(6.3) 



/ I ai (u)Ib 2 (v)K(u, v) du dv j I ai (11)1^ (v)K(u, v) du dv 
J I a2 (u)Ib 2 (v)K(u, v) du dv J I a2 (u)Ib! (v)K{u, v) du dv 



•32 



D. ST. P. RICHARDS 



By two applications of the basic composition formula [Karlin (1968), page 17], 
once in u and once in v, we see that (6.3) reduces to 



(6.4) 



Ul<U2 V\<V 2 



I ai (ui) I ai (u2) 



K{ui,v{) K(u 1 ,v 2 
K(u 2 ,vi) K(u 2 ,v 2 



du\ dvi du 2 dv 2 . 



The determinant det(I ai (xj)) is well known to be nonnegative if x\ < x 2 and 
ai < a 2 ; indeed, the set of functions {I ai ,I a2 :a\ < a 2 } is an example of a 
weak Tchebycheff system [cf. Karlin (1968), Chapter 1]. Therefore the first 
two determinants in (6.4) are nonnegative on the region {u± < u 2 ,v\ < v 2 }. 
Since K is TP2, then the third determinant also is nonnegative on the same 
region. Hence Cov(fi(X 1 ,X 2 ), / 2 (*i,* 2 )) > 0. 

Next suppose that f\ and f 2 are functions on IR n of the form 

(6.5) f j (x 1 , ...,x n )= I ajil (x 1 )I a . >2 (x 2 ) ■ ■■I ajn {x n ), 

for j = 1, 2. We assume, by induction, that the FKG inequality holds for all 
functions of the above form for all dimensions up to n — 1. Thus, we have 

(6.6) f j (x 1 ,...,x n ) = g j (xi,...,x n -i)I ajin (x n ), 
where gj(xi,.. .,x n -i) = I ajtl (xi) ■ • ■ I aji „_i {x n -i), j = 1,2, and 

Cox{ 9l {X u . . .,X n - 1 ),g 2 (X 1 , . . . > 0. 

Using the standard method based upon conditional expectations, we now 
complete the inductive step; we underscore that this inductive step is well 
known and we provide details only for the sake of completeness. Denoting 
by Ex lt ...,x n -i\x n expectation with respect to the probability distribution of 
the random variables X\, . . . , X n _i conditional on X n , it follows from (6.6) 
and the law of total probability that 

, s Efif 2 = E Xn E x x \ X Jxf 2 
6.7 

2 

= Ex n Ia hn (X n )Ia 2tn (X n )E Xu ... : x n „ 1 \x n II 9j(Xi, . . .,X n -i). 

3=1 

Since (X 1 ,...,X n ) is MTP 2 , then (Xi, . . . ,X n -i)\X n is also MTP 2 [see 
Sarkar (1969) and Karlin and Rinott (1980)]. Therefore, by inductive hy- 
pothesis, 

E x l ,...,x n - 1 \x„9i92 > (E Xl ,...,x„- l \x n gi)(Ex 1 ,...,x n - 1 \x n 92), 
and then it follows from (6.7) that 

Eflf 2 >E Xn MXn)MXn), 
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where 

ipj{x n ) = I ajn {x n )E Xl ^ Xn _ 1 \ Xn=Xn g j {X 1 ,. . .,X n -i), 

j = 1,2. Since {X\ , . . . , X n )\X n is MTP2 and gj is increasing, then [cf. Sarkar 
(1969) and Karlin and Rinott (1980), page 484, Theorem 4.1] the function 
Ex u -,x n - 1 \X n =x n 9j(Xi, ■ ■ -, x n-i) is increasing in x n ; hence Vi and ip 2 are 
both increasing, so that Eipiip 2 > EipiEfa- Therefore we obtain 

Ehh > Efoih > EfaEih = EftEf 2 , 

and the proof of the FKG inequality for functions of the type (6.5) is com- 
plete. 

Having established the FKG inequality (1.2) for all indicator functions 
of the form (6.5), we observe that the functional (/i,/2) — > Cov(/i,/2) is 
bilinear. Hence (1.2) holds for all functions f\ and f 2 on ]R n of the form 

r n 

f(xi,...,X n ) = Y[l aij (xj), 

i=l j=l 

(xi, . . . , x n ) G W 1 , where r G N and Cj > 0, i = 1, . . . ,r. Denoting by M+(R n ) 
the set of positive Borel measures on R 2 , it then follows by approximation 
arguments that the FKG inequality (1.2) holds for all functions f\ and f 2 
of the form 

„ n 

(6.8) f(xi,...,x n )= T I a Axj)du(ai, . . . ,a n ), 

( Xl ,...,x n ) €M n , where v£M + (M n ). 

It is well known that the set of functions of type (6.8) is a proper sub- 
set of the class of increasing functions on M n and contains all cumulative 
distribution functions on M n . 

Turning to the third-order FKG inequality in Theorem 1.1, we can also 
establish that result for the class of functions (6.1). Suppose that we have 
three indicator functions, fj(u,v) = I a . (u)ify (v ), (u,v) E ]R 2 , j = 1,2,3. To 
establish Theorem 1.1 for f%, f 2 and f$, we may assume, by symmetry, 
that a\ < a 2 < 03. Then the proof requires that we resolve six cases, each 
corresponding to an ordering of b±, b 2 and 63. In what follows, we shall 
denote P(X\ >cii,X 2 > bj) by pij. 

Case 1 b\ < b 2 < 63. In this case, fifj = /, for i < j. Therefore 

«3 = 2 /°33 - [P22P33 + P33P22 + PII/O33] + PllP22P33 
= (2-Pii)(1-P22)P33, 

which, clearly, is nonnegative. 
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Case 2 bi < b 3 < b 2 . Here, we have 

^3 = 2p 32 - [/022/533 + P33P22 + P11P32} + P11/022P33 
= (2 - Pll)(p 3 2 - P33P22)- 

By the FKG inequality, p 32 - P33P22 = E(/ 3 / 2 ) - E(/ 3 )E(/ 2 ) > 0; therefore 
4 > 0. 

Case 3 62 < b\ < 63. In this case, we have 

«3 = 2 P33 - [P21P33 + P33P22 + Pll/>33] + PllP22P33 
= P33[1 ~ P21 + (1 -Pll)(l -P22)], 

which, clearly, is nonnegative. 
Case 4 b 2 < b 3 < b±. Here, 

K 3 = 2 P31 - [/021P33 + P31P22 + PllP33] + PllP22P33 
= (1 - p 2 2)(P31 - P33Pll) + P31 ~ P21P33- 

By the FKG inequality, both p 31 - p 3 3Pn = E(/ 3 /i) - E(/ 3 )E(/i) and p 31 - 
P33P21 = E(/ 3 / 2 /i) - E(/ 3 )E(/ 2 /i) are nonnegative. Therefore k' 3 > 0. 

Case 5 b 3 < b\ < b 2 - In this case, we have 

K 3 = 2p 32 - [P22P33 + P31P22 + PIIP32} + P\\P22P33 
= (1 - Pll)(P32 - P33P22) + P32 ~ P31P22- 

By the FKG inequality, both p 32 - P33P22 = E(/ 3 / 2 ) - E(/ 3 )E(/ 2 ) and p 32 - 
p 3 ±p 22 = E(/i/ 2 / 3 ) - E(/ 3 /i)E(/ 2 ) are nonnegative. Therefore k' 3 > 0. 

Case 6 b 3 <b 2 <b±. In this case, we have 

K 3 = 2 P3i - [P21P33 + P31P22 + P11P32] + P11P22P33 

= (1 - p 2 2)(P31 - P33/Oll) +P33(1 - P2l) +Pll(P33 ~ P32)- 

By the FKG inequality, p 31 - p 33 p u = E(/ 3 /i) - E(/ 3 )E(/i) > 0. Also, since 
1 - / 2 > 0, p 33 - p 32 = E(/ 3 ) - E(/ 2 / 3 ) = E/ 3 (l - / 2 ) > 0. Therefore k' 3 > 0. 

Having resolved the case in which the fj are indicator functions, we ap- 
ply the multilinearity of the functional (/i,/ 2 ,/ 3 ) — > « 3 (/i, / 2 , /s), and an 
approximation argument, to deduce nonnegativity of ac 3 (/i, / 2 , / 3 ) for all /j 
of the class (6.8) with n = 2. 

In the case of K4, we have carried out the case-by-case analysis (consisting 
of 24 cases); as regards k' 5 , we unhesitatingly entrust the analysis (of all 120 
cases) to the reader. 
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Even in the case of the classical FKG inequality, the method of indicator 
functions appears to be new. The technique has the obvious drawback that 
it yields the classical FKG inequality only for a proper subset of the class 
of all increasing functions, and that too by way of a case-by-case analysis. 
However, the method has the advantage that it points the way toward gen- 
eralizations of that inequality; indeed, the method of indicator functions is 
the means by which we first discovered instances of the nonnegativity of k' 3 . 

In developing the method of indicator functions, the appearance of the 
Binet-Cauchy formula is noteworthy. To explain, we first remark that in 
a previous paper, Gross and Richards (1995) developed a Binet-Cauchy 
formula in the context of finite reflection groups and obtained an analog of 
the FKG inequality for one particular reflection group. We speculate that 
analogs of those inequalities exist in any context in which Binet-Cauchy 
formulas are available, including those of Karlin and Rinott (1988). 

A general context in which formulas of Binet-Cauchy type arise is given 
by Selberg (1956) in a fundamental paper laying the groundwork for the 
general theory of what is now known as Selberg' s trace formula. Let S denote 
a Riemannian space, with local coordinates t ,...,t n and positive-definite 
metric 



where the functions gij are analytic in t , . . . ,t n . We assume that we have 
a locally compact group G of isometries of S, with a transitive action. A 
function K : S x S — > C is point-pair invariant if K(gx, gy) = K(x, y) for all 
x,y € S and g G G. One of the basic problems considered by Selberg (1956) 
is the analysis of the spectral theory of the integral operator 



where dy denotes the invariant measure on S derived from the metric ds 2 
and / is a suitable function. 

Let r denote a discrete subgroup of G which acts properly and discontinu- 
ously on S. Denote by 2) the fundamental domain of T. With U(d) denoting 
the group of unitary d x d matrices, let x '■ F — > U(d) be a unitary represen- 
tation of r. Let F be a complex, vector-valued function on S such that, for 
all x € S and M G T, F(Mx) = x(M)F(x). Then it is a readily established, 
but important, result that 



ds 2 = Y / g ij dt i dt j 





where 



(6.9) 
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see Selberg [(1956), page 59]. Formally, there exists an eigenfunction expan- 
sion of K x : 

K x (x,y)=Y^c i F l (x)F i (y)\ 

where the c% are constants, the Fi are eigenfunctions of a class of differential 
operators invariant under the action of G, and Fi(y)* is the transpose of 
the complex conjugate of Fi(y). What is of interest to us here is that the 
function K x possesses properties of the determinant function [defined by 
Gross and Richards (1995)] for finite reflection groups. Indeed, it follows 
from (6.9) that 

K x (x,My) = K x (x,y) X (M- 1 ) 

for all x,y G S, ME T; this property generalizes the familiar sign-change 
behavior of the classical determinants under the interchange of rows or 
columns. The function K x also satisfies a generalized Binet-Cauchy formula: 
If K and L are point-pair invariant functions, and 

Q(x,y) := f K(x,t)L{t,y)dt 
Js 

then Q is point-pair invariant and 

Q x ( x >y) = / K x( x > f ) L x fay) dt - 

J® 

We conjecture that inequalities of FKG-type exist in this context. Owing to 
the potential for applications in multivariate statistical analysis, of special 
interest for us will be the case in which S is the space of positive-definite 
symmetric matrices. 
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